On the doubt about margin explanation of boosting

نویسندگان

  • Wei Gao
  • Zhi-Hua Zhou
چکیده

Margin theory provides one of the most popular explanations to the success of AdaBoost, where the central point lies in the recognition that margin is the key for characterizing the performance of AdaBoost. This theory has been very influential, e.g., it has been used to argue that AdaBoost usually does not overfit since it tends to enlarge the margin even after the training error reaches zero. Previously the minimum margin bound was established for AdaBoost, however, Breiman [10] pointed out that maximizing the minimum margin does not necessarily lead to a better generalization. Later, Reyzin and Schapire [34] emphasized that the margin distribution rather than minimum margin is crucial to the performance of AdaBoost. In this paper, we show that previous margin bounds are special cases of the kth margin bound, and none of them is really based on the whole margin distribution. Then, we improve the empirical Bernstein bound given by Maurer and Pontil [28]. Based on this result, we defend the margin-based explanation against Breiman’s doubt by proving a new generalization error bound that considers exactly the same factors as Schapire et al. [35] but is uniformly tighter than Breiman [10]’s bound. We also provide a lower bound for generalization error of voting classifiers, and by incorporating factors such as average margin and variance, we present a generalization error bound that is heavily related to the whole margin distribution. Finally, we provide empirical evidence to verify our theory.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Refined Margin Analysis for Boosting Algorithms via Equilibrium Margin

Much attention has been paid to the theoretical explanation of the empirical success of AdaBoost. The most influential work is the margin theory, which is essentially an upper bound for the generalization error of any voting classifier in terms of the margin distribution over the training data. However, important questions were raised about the margin explanation. Breiman (1999) proved a bound ...

متن کامل

On the Margin Explanation of Boosting Algorithms

Much attention has been paid to the theoretical explanation of the empirical success of AdaBoost. The most influential work is the margin theory, which is essentially an upper bound for the generalization error of any voting classifier in terms of the margin distribution over the training data. However, Breiman raised important questions about the margin explanation by developing a boosting alg...

متن کامل

Margin Distribution Controlled Boosting

Schapire’s margin theory provides a theoretical explanation to the success of boosting-type methods and manifests that a good margin distribution (MD) of training samples is essential for generalization. However the statement that a MD is good is vague, consequently, many recently developed algorithms try to generate a MD in their goodness senses for boosting generalization. Unlike their indire...

متن کامل

Boosting the margin: A new explanation for the effectiveness of voting methods

One of the surprising recurring phenomena observed in experiments with boosting is that the test error of the generated classifier usually does not increase as its size becomes very large, and often is observed to decrease even after the training error reaches zero. In this paper, we show that this phenomenon is related to the distribution of margins of the training examples with respect to the...

متن کامل

Boosting as a Regularized Path to a Maximum Margin Classifier

In this paper we study boosting methods from a new perspective. We build on recent work by Efron et al. to show that boosting approximately (and in some cases exactly) minimizes its loss criterion with an l1 constraint on the coefficient vector. This helps understand the success of boosting with early stopping as regularized fitting of the loss criterion. For the two most commonly used criteria...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Artif. Intell.

دوره 203  شماره 

صفحات  -

تاریخ انتشار 2013